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^J . Abstract 

'■H i We consider a likelihood ratio method for testing whether a mena- 

ce | tone baseline hazard function in the Cox model has a particular value 

at a fixed point. The characterization of the estimators involved is pro- 
vided both in the nondecreasing and the nonincreasing setting. These 
characterizations facilitate the derivation of the asymptotic distribu- 
^. | tion of the likelihood ratio test, which is identical in the nondecreasing 

l/~) i and in the nonincreasing case. The asymptotic distribution of the like- 

0^ \ lihood ratio test enables, via inversion, the construction of pointwisc 

confidence intervals. Simulations show that these confidence intervals 
exhibit comparable coverage probabilities with the confidence intervals 
^T ■ based on the asymptotic distribution of the nonparametric maximum 

^-^ ' likelihood estimator of a monotone baseline hazard function. 
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1 Introduction 

In survival analysis, using Cox proportional hazards model [7] is the typ- 
ical choice to account for the effect of covariates on the lifetime distribution. 
Its attractiveness resides in its form, that allows for efficient estimation of 
the regression coefficient, while leaving the baseline distribution completely 
unspecified, see e.g., [I0j CEEl [20]. The regression coefficient estimator is 
the well-known maximum partial likelihood estimator [8]. As a response 
to Cox's paper, Breslow [7] proposed a different approach, that yields the 
same maximum partial likelihood estimator, along with an estimator for 



the baseline cumulative hazard function Ao. Impressive amount of research 
rapidly followed Cox's seminal paper, which primarily focused on deriving 
the (asymptotic) properties of the maximum partial likelihood estimator of 
the regression coefficient /3 n , as well as of the Breslow estimator A n of the 
baseline cumulative hazard function. 

Even though the baseline hazard Ao can be left completely unspecified, 
in practice, one might be interested in restricting Ao qualitatively. This can 
be done by assuming the baseline hazard to be monotone, for example, as 
suggested by Cox himself [7]. Various studies have indicated that a mono- 
tonicity constraint should be imposed on the baseline hazard, which complies 
with the medical expertise. For an illustration of a nonincreasing baseline 
hazard estimator in the study of patients with acute coronary syndrome, 
see |22j . Lopuhaa and Nane [17] propose a nonparametric maximum likeli- 
hood estimator and a Grenander type estimator for estimating a monotone 
baseline hazard function. The Grenander type estimator is defined in terms 
of slopes of the greatest convex minorant of the Breslow estimator A n . The 
two estimators have been proven to be strongly consistent and have been 
shown to exhibit the same distributional law. Furthermore, at a fixed point 
xq, the scaled difference between the maximum likelihood estimator A n (xo) 
and the true baseline hazard Ao(xo) converges in distribution to the distri- 
bution of the minimum of two-sided Brownian motion plus a parabola times 
a constant depending on the underlying parameters. These results adhere 
to the general nonparametric shape constrained theory, and, in particular, 
prolong naturally the findings in the case of the random censorship model 
with no covariates |14j . 

Ensuing inference will be pursued in this paper, by testing the hypothesis 
that the underlying monotone baseline hazard has a particular value 6q , at a 
fixed point xq. We will use a likelihood ratio test of Hq : \q{xq) = &o versus 
H\ : Ao(xo) 7^ 0q- Within the shape restricted problems, this approach was 
initially employed for monotone distributions in the current status model 
by Banerjee and Wellner [5]. The authors focused on deriving the limiting 
distribution of the likelihood ratio test under the null hypothesis, and to 
obtain what the authors referred to a fixed universal distribution, defined in 
terms of slopes of the greatest convex minorant of the two-sided Brownian 
motion plus a parabola. These findings were followed by a rapid stream of 
research, see, e.g., [SOS], that revealed that the likelihood ratio method 
could be extended straightforwardly in other shape constrained settings. 
In this paper, we carry on this research for the monotone baseline hazard 
function in the Cox model. In addition to extending directly the results 
in the right censoring model with no covariates in [3], we aim to provide a 



thorough description of the method and detailed proofs for all results. 

Furthermore, we will derive confidence sets for Ao(xo)> based on the like- 
lihood ratio method. More specifically, we will use that inverting the family 
of tests can yield, in turn, pointwise confidence intervals for the baseline 
hazard function. A more direct method of constructing pointwise confi- 
dence intervals is based on the asymptotic distribution, at a fixed point xq, 
of the nonparametric maximum likelihood estimator A n (xo), derived in [17] . 
Nonetheless, this entails the bothersome issue of estimating the nuisance 
parameter, and more specifically, estimating the derivative of the baseline 
hazard function A'(xo), since, to the author's best knowledge, there is no 
available smooth monotone estimator of the baseline hazard function in the 
Cox model. One option would be to kernel smooth the NPMLE A n , but this 
would pose extra difficulties, like an appropriate choice of a bandwidth. For 
a discussion of this issues in the case of right-censoring with no covariates, 
see [4]. 

The paper is organized as follows. Section [2] introduces the Cox model, 
the notations and the common assumptions. In Section O we introduce the 
likelihood ratio method and characterize the maximum likelihood estimator 
A n of a monotone baseline hazard function and the estimator A^, for which 
\n( x o) = $0) for a fixed xq in the interior of the support of the baseline dis- 
tribution. We provide the characterization of the two estimators in the case 
of both nondecreasing and nonincreasing baseline hazard functions Ao • The 
asymptotic distribution of the likelihood ratio statistic is provided, along 
with preparatory lemmas, in Section [4] Finally, Section [5] is devoted to 
constructing pointwise confidence intervals and comparing them, via simu- 
lations, with the conventional confidence intervals based on the asymptotic 
distribution of the NPMLE A n . 



2 Definitions and assumptions 

Suppose that the observed data consist of the following independent and 
identically distributed triplets (Tj,Aj,Zj), with i = l,...,n. The event 
time, denoted by X and commonly referred to as the survival time is sub- 
ject to random censoring. Thus, T = min(X, C), where T is the follow-up 
time and C denotes the censoring time. The indicator A = {X < C} marks 
whether the follow-up time is an event or a censoring time. Finally, Z 6 R p 
denotes the covariate vector of the observed follow-up time T, which is as- 
sumed to be time invariant. The event time X and censoring time C are 



assumed to be conditionally independent, given the covariate vector Z. Fur- 
thermore, let F be the distribution function of the non- negative random vari- 
able X, G the distribution function of the non- negative random variable C, 
and H the distribution function of T. The distribution function F{x\z) is 
assumed to be absolutely continuous, with density f{x\z). Similarly, the 
distribution function G{c\z) is assumed to be absolutely continuous, with 
density g{c\z). In addition, F(x\z) and G{c\z) share no parameters, thus 
the censoring mechanism is assumed to be non-informative. 

Let X(x\z) be the hazard function for an individual with covariate vector 
z eW. The Cox model [7] specifies that 

\{x\z) = \ (x)eP'o z , (2.1) 

where Ao represents the baseline hazard function, that corresponds to z = 0, 
and /3o £ R p is the vector of the underlying regression coefficients. Finally, 
we consider the following assumptions, that are typically employed when 
deriving large sample properties of estimators within the Cox model (e.g., 
see HE])- 

(Al) Let tf,tq and th be the end points of the support of F, G and H 
respectively. Then 

(A2) There exists e > such that 



< 00, 



sup E 

|/3-A)|^e 

where I • I denotes the Euclidean norm. 



3 The likelihood ratio method and the character- 
ization of the estimators 

By definition, A(x|2:) = — log(l — F{x\z)) is the cumulative hazard function. 
Thus, from (|2.1[) . it follows that A(x|z) = Ao(x) exp(/?Qz), where Ao(x) = 
Jq Ao(n) du is the baseline cumulative hazard function. Since, for a continu- 
ous distribution, A(i) = /(t)/(l — F(t)), for t ^ 0, the full likelihood is given 
by 
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As the censoring mechanism is assumed to be non- informative, and by (|2.ip . 
maximizing the full likelihood is the same as maximizing 



YlHTi I ^) A *exp[-A(7- | Zi)] = fl ho(Ti)^ Zi 
i=i i=i 
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which yields the following (pseudo) loglikelihood function, written as func- 
tion of fi e W and Aq 



^[AilogAoCTO + Ai/S'Zi- 

i=l 



/9% 



Ao(T 4 ) 



Let Tm < T^ < 



(i) 



(2) 



< 



T( n ) be the ordered follow-up times and, for 



l,...,n, let A(j) and Z^ be the censoring indicator and covariate vector 
corresponding to T^y Writing the above (pseudo) likelihood as a function 
of fi and Aq gives 



n r „1 

Lp(\o) = ^ A (i) log A (%) + A (4) /3'Z (4) - e^'%> 

i=i L Jo 



(•) 



Xq(u) du 



• (3-1) 



Following the approach in |17| . we do not proceed with the joint maximiza- 
tion of (|3.ip over fi and monotone Ao- Alternatively, for fi e R p fixed, 
we consider maximum likelihood estimation of a monotone baseline hazard 
function Ao and denote the estimator by X n (x; fi). Afterwards, we simply re- 
place fi by fi n , the maximum partial likelihood estimator ( see, e.g., [3 [8]) of 
the underlying regression coefficients fio, due to its commendable asymp- 
totic properties (see, e.g., [TO], [181 120] )• The proposed NPMLE is thus 
X n (x) = A n (x; /3 n ) and will be referred to as the unconstrained estimator 
of a monotone Ao- Furthermore, for fi e W fixed, we maximize the loglikeli- 
hood function L / g(Ao) in (|3.ip over the class of all monotone baseline hazard 
functions, under the null hypothesis Hq : Xq{xq) = 9o, for xq e (0, th) and 
9q e (0, oo), fixed. We obtain X n (x;fi) and hence propose X n (x) = X n (x;fi n ) 
as the constrained NPMLE. 

Replacing fi by fi n also in the loglikelihood function (|3.ip yields the 
likelihood ratio statistic for testing Hq : Ao(£o) = #o> 



2logUOo) = 2L, (X n )-2L g (A°). 



(3.2) 



Thus, for computing the likelihood ratio statistic, we need to characterize 
the unconstrained NPMLE X n (x) and the constrained NPMLE X n (x) of a 
monotone baseline hazard function Aq- 



3.1 Nondecreasing baseline hazard 

We consider first maximum likelihood estimation of a nondecreasing 
baseline hazard function Ao- Both the unconstrained estimator A n and the 
constrained estimator A„ will be characterized in terms of the processes 

W n (P,x)= f (eP' z F{u> S }ds) dP n (u,S,z), (3.3) 

and 

V n (x) = (5{u<x}dP n (u,5,z), (3.4) 

with (3 6 R p and x > 0, and where P n is the empirical measure of the 
(Tj,Aj,Zj), with i = 1, ...,n. The characterization of the unconstrained 
estimator A n (x;/3) has already been provided in Lemma 1 in [17j . which 
we restate below. Furthermore, we provide a closed form of the estimator 
on blocks of indices on which the estimator is constant. This expression 
will be useful in deriving the asymptotic distribution of the likelihood ratio 
statistic. 

LEMMA 3.1. Let Try < . . . < Tr n \ be the ordered follow-up times and 
consider a fixed f3 e R p . 

(i) Let W n and V n defined in ([H]) and i$SM- Then, the NPMLE \ n (x; f3) 
of a nondecreasing baseline hazard function Ao is of the form 

r x<T (1) , 
X n (x;/3) = < A» T (i) < x < T (i+1) , for i = 1, . . . ,n- 1, 

^oo x ^ T (n) , 

where \ is the left derivative of the greatest convex minorant (GCM) 
at the point Pi of the cumulative sum diagram (CSD) consisting of the 
points 

Pj = (w n (J3,T u+1) ) - W n (J3,T {1) ),V n (T m) j) f (3.5) 

for j = 1, . . . , n — 1 and Pq = (0, 0) . 

(ii) For k > 1, let Bi,...,Bf. be blocks of indices such that X n (x; /3) is 
constant on each block and B± u . . . u B^ = {1, . . . ,n — 1}. Denote by 
v n j(/3) the value of \ n (x; f3) on block Bj. Then, 

v njW) = = p= 1vra ^-- (3.6) 

SieS, R.+l) - T { i)\ 1,1=4+1 e ' (0 



Proof. The proof of (i) has been provided by Lemma 1 in [T7J . The NPMLE 
A n (x; (3) is obtained by maximizing the (pseudo) loglikelihood function in (j3.ll 
over all ^ Ao(T( 1 )) ^ . . . ^ Ao(T( n )). As argued in [17], the estimator has 
to be a nondecreasing step function, that is zero for x < Tn\ , constant on the 
interval [T(i\,T( i+1 \), for i = 1, . . . , n — 1 and can be chosen arbitrarily large 
for x ^ Tt n y Then, for fixed (3 e R p , the (pseudo) loglikelihood function 
in (|3.1|) reduces to 

Th — 1 Tl % — 1 

^ A (J) log A (T (J) ) - g e^M ^ [Ty +1) - %] A (%) 

, (3.7) 



n-l 



2 A (0 log A (T W ) - A (T W ) [T (4+1) - T (i) ] £ ^' 

i=l I Z=i+1 



(?) 



Let Aj = Ao(T(j)), for % = 1, ...,n— 1 and A = (Ai, . . . , A n _i). Then, finding 
the NPMLE reduces to maximizing 



n-l 



0(A) = 2 j A (4) log A, - A, [T (i+1) - T (i) ] 2 e"' z (0 [ , (3.8) 

i=l I J=t+1 J 

over the set < Ai < . . . < A n _i. The NPMLE corresponds thus to a 
vector A = (Ai, . . . , A n -i) that maximizes <f> over < Ai < . . . < A n _i. To 
prove (ii), we first derive the Fenchel conditions of the estimator. Thus, we 
will show that the estimator X n (x; (3) maximizes the (pseudo) loglikelihood 
function in (|3.1[) over the class of nondecreasing baseline hazard functions if 
and only if 



Pb'+i) " %] £ e/3 ' Z(0 \ < °> (3-9) 




for i = 1, . . . , n — 1, and 



n— 1 I a n 

E T^-[ r (i + D-%] E e^(^A i = 0. (3.10) 

j=l [ A J l=j+l 

The NPMLE A n (x;/3) is thus uniquely determined by these Fenchel condi- 
tions. The rest of the proof focuses on deriving the Fenchel conditions f)3.9f) 
and (|3.10p and on establishing (|3.6[) . 

First, note that the function <f> in f|3.8[) is concave and that the vector of 
partial derivatives V0(A) = (Vi^(A), . . . , V n _i0(A)) is given by 



V0(A) 



(^f - [r (a) - r (1) ] 2 e^(o, . • • , ^=f - [r (B) - r (B _ 1} ] e^oo) 



Define now the functions gi(X) = Aj_i — Aj, for i = 1, . . . , n — 1 and Ao = 0, 
and the vector g(X) = (51(A), . . . , <? n _i(A)). Moreover, define the matrix of 
partial derivatives by 

Let </>(A) = —cj)(X). Then, maximizing (|3.8p over all ^ Ai ^ . . . < A n _i is 
equivalent with minimizing 4>(X) under the restriction that all components 
of the vector g(X) are negative. An adaptation of the Karush-Kuhn- Tucker 
theorem (e.g., see Theorem 8.1 in [12]) states that A minimizes <f> over all 
vectors A such that gi(X) < 0, for all i = 1, ... ,n — 1, if and only if the 
following conditions hold 

V0(A) + G T a = 0, (3.12) 

g(X)+W = 0, (3.13) 

(a,w) = 0, (3.14) 

for a = (ai, . . . , a n _i), with on > 0, i = 1, . . . , n—1 and w = (i«i, . . . , w n -i), 
with iOj > 0, for i = 1, . . . ,n — 1. The first condition (|3. 12[) . yields that 

<* = - 2 V^(A) = " S j ^ - [T W+1) - %] 2 e^« 1 . (3.15) 

Since c^ ^ 0, for all i = 1, . . . , n—1, condition (J3.9H is immediate. From (I3.13p . 
w = —g(X) = (Ai — Ao, ■ • • , A n _i — A n _2), with Ao = 0. Note that the condi- 
tion Wi > implies that Aj_i < Aj, for all i = 1, . . . , n — 1, which is trivially 
satisfied. Finally, by (J3T4"]) . 

n-l 

2(A i -A i _i)2v^(A) = 0, 

which re- writes exactly to ()3.10p . 

To derive the expression in (|3.6h . we prove first that (|3.9p and fj3. 10f) 
imply that 



n-l [ 



2 \^-[T U+1) -T U) ] 2 e^CO =0. (3.16) 

Condition (|3.9p gives that Xi?=i ^j^W ^ 0- I n addition, as the maximizer 
A is nondecreasing, 

n-l 

Ai 2 V^(A) = - V 2 <A(A)A 2 - V 3 <A(A)A 3 - ... - V n _i0(A)A„_i 
+ V 2 0(A)Ai + V 3 ^(A)Ai + . . . + V n _i^(A)Ai 

n-l 

= ^ (Ai-i - A*) 2 V^(A) ^ 0. 

i=2 j>i 

This shows (|3.16p . Now let Bi, . . . ,Bk be blocks of indices on which A is 
constant such that B\ u . . . u Bk = {1, ... ,n — 1} and let v n j(/3) be the 
value of A on the block Bj, with j = 1, . . . , k. If k = 1, then the expres- 
sion of v n i is immediate from ()3.16p . Moreover, observe that, by (|3.14p . 

2i=i «i (Ai — Aj_i J = 0, and since ctj ^ and Aj > Aj_i, for any i = 

l,...,n — 1, it will follow that «j = 0, whenever Aj_i < Aj. Hence, for 
k ^ 2, there exist k — 1 a's that are zero. Then (13.6P follows by (13.151) 
and (|3.16p . For example, for k ^ 3, choose any two consecutive a, that are 
zero. From (I3.15p . we get that by subtracting these c^'s, 

As v n j([3) is constant on .Bj, this yields ()3.6[) . D 

As mentioned beforehand, the proposed unconstrained estimator is thus 
An (a;) = X n (x;f3 n ). Equivalently, on each block of indices Bj, for j = 
l,...,k, we propose the estimate v n j = v n j(/3 n ). Under the null hypoth- 
esis Hq : Xq(xq) = 6q, the characterization of the constrained maximum 
likelihood estimator A^ is provided by the next lemma. 

LEMMA 3.2. Let xo £ (0,r#) fixed, such that T/ m \ < xo < T( m+ i), for a 
given 1 < m < n — 1 . Consider a fixed (3 e R p . 

(i) For i = 1, . . . , m, let Xf be the left derivative of the GCM at the point 
P^ of the CSD consisting of the points P- = Pj, for j = 1, ... ,rn, with 



Pj defined in (|3.5p and Pq = (0, 0) . Moreover, for i = m+ 1, . . . , n— 1, 
let \i be the left derivative of the GCM at the point P^ of the CSD 
consisting of the points P^ = Pj, for j = m, . . . , n — 1, with Pj defined 

in (|3,5p . Then, for 6q e (0, oo), the NPMLE A°(x) of a nondecreasing 
baseline hazard function Ao, under the null hypothesis Hq : Ao = 9o, is 
of the form 



A°(x;/3) 





A? 



A 



o 

III: 





x<T (1) , 

T(i) sc x <% fl) , forie {1, . 

T(rn) ^x <x , 

x < x < T (m+1) , 



,n — l}\{m} 



X : 



x >T, 



(n), 



( 3 - 17 ) 
where A? = min(Af , #o)> /or i = 1, . . . , m, and A° = max(Af , #o), /< 



o;- 



m + 1, . . . ,n — 1. 



(mJ For k > 1, Ze£ B®,...,B® be blocks of indices such that A^(x;/3) is 
constant on each block and B®vj . . .vj B9 = {l,...,ra — 1}. Then, there 
is one block, say B®, on which X^(x; j3) is equal to 0$, and one block, 
say Bp, that contains m. On all other blocks Bj, denote by fJL(/3) the 
value o/A°(x;/3) on block Bj. Then, 



vim 



2ieB? \i) 



YuieB , [ T (i+1) ~ T (i)] Z?=i+1 ^'' 



'(0 



(3.18) 



for j = 1, . . . ,p — l,p + 1, . . . , k. On the block B® that contains m, 



V°nM 



S, 



eS° A (0 



SiGBO\{m} [ T (i+1) ~ T «] ELi+1 e/3 ' Z(0 + [ x - T (m) ] 2 



(3.19) 



P /3'Z, 



(i) 



Proof. We will derive the Karush-Kuhn- Tucker (KKT) conditions, that uniquely 
determine the constrained NPMLE, and which implicitly provide the char- 
acterization in (ii). To prove the lemma, we will show that the estimator 
proposed in (i) satisfies these conditions. 

The constrained NPMLE estimator is obtained by maximizing the objec- 
tive function (|3.ip over sc A (T( 1 )) < . . . < A (T( m )) < 9 < A (T( m+1 )) < 



10 



. . . ^ Ao(T( n _ 1 \). In line with the reasoning for the unconstrained estima- 
tor, it can be argued that the constrained estimator has to be a nonde- 
creasing step function that is zero for x < Tm, constant on \Tu\,Tu + \\), for 
i = 1, . . . , n— 1, is equal to 9q on the interval [xq, T(, m+1 \), and can be chosen 
arbitrarily large for x > Ti n \. Therefore, for a fixed (3 e R, the (pseudo) 
loglikelihood function in (|3,ip reduces to 

to— 1 f n "J 

^ ]A (i) logAo(T (0 )-Ao(r (i) )[r (i+1) -r (0 ] ^ e^'M 

(I 



' z (0 



A (m) log A (T (m) ) - A (T (m) ) [x - T (m) ] J] e^' 

Z=m+1 

n 

^[r (m+ i)-x ] 2 e/3 ' Z(0 

n-i r 

Yi \ A (i) lo g A o( T (i)) " x o(T(i)) [T(i+i) - %] 2 e 



(3.20) 



P' z m 



By letting Aj = Ao(T/^), for i = 1, . . . ,n— 1 and A = (Ai, . . . , A n _i), we then 
want to maximize 

to— 1 f n \ 

/(A) = 2 | A (i) log A, - A, [T {1+1) - T (i) ] 2 e^' z (0 



A (m) logA m -A m [x -T (m) ] J] e^w (3.21) 

n—X f n ~\ 

2 A (i) logA i -A i [r (m) -T (i) ] ^ ^ 



'%) 



j=m+l ^ Z=i+1 



over the set < Ai < . . . < A m < 6q ^ A m+ i < ... < X n -i- Let the 
vector A c = (Af , . . . , X^-i) denote the constrained NPMLE under the null 
hypothesis Hq : Xo(xq) = 9q. We will show next that A c maximizes the 
objective function in (|3.2ip over the class of nondecreasing baseline hazard 
functions, under the null hypothesis, if and only if the following conditions 
are satisfied 

{A n 

-^ ~ [%H) " T U)] E e/3 ' Z(i ' \>°> for i = 1, . . . , m - 1, 
A 3 1=3+1 J 

(3.22) 
11 



m— 1 ( a n 



A (m) 

A c 

x rn l=m+l 



(3.23) 



|>o-r (m) ] 2 e^ z (0>0, 




[T (j+1) - T (j) ] ^ e^« [ < 0, for i = m + 1, . . . , n - 1, 

(3.24) 



and 



n— 1 f a n 1 

^ m (3.25) 

The NPMLE A c is thus uniquely determined by these conditions. To prove 
(i), we will show that A^ defined in ()3.17|) verifies the Karush-Kuhn- Tucker 
(KKT) conditions (|3.22|) - (|3.25[) . Therefore, A^ is the unique maximizer of 
0°(A) in (JIOT]) . over the set sS Ai sS . . . < A m < 6 < A m+i < . . . < A n _i. 
As it will be seen further, despite bothersome calculations, the distinct form 
of the likelihood grants a unified framework for deriving the KKT conditions, 
that uses all the follow-up times, unlike the reasoning in [5], where the 
(pseudo) loglikelihood is split and arguments are carried both to the left 
and to the right of xq . 

Similar to the unconstrained case, observe that the function cjp is concave 
and that the vector of partial derivatives is V^°(A) = (Vi</>°(A), . . . , V n _i^>°(A)), 
with 

A n 

V,*°(A) = -f- - [T (i+1) - T (i) ] 2 e? z a>, 

l=i+l 

for i = 1, . . . ,m — l,m + 1, . . . , n — 1, and 



A n 

V m /(A) = ^-[* -T (m) ] 2 e^ 



Note that the form of V m </>°(A) differs from the form of Vi</>°(A), for i = 

1, . . . ,771—1,771+1, . . . ,ri—l. Moreover, define the vector g(X) = (g±(X), . . . , g„_i(A)), 

with 



12 



9iW 



Aj - Aj+i i= l,...,m — 1, 

A m - Oo i = m, 

Oo-^m+i i = m + l, 

Aj_i — Aj i = m + 2, . . . , n — 1, 



and consider the matrix of partial derivatives defined in (|3.1ip . Computa- 
tions as in (J3.15P can be derived to show that condition (|3.12p yields (13. 22|1 - 
(|3.24p . upon noting that 



a; 



E^V^°(A C ) 



i = 1, ... ,771, 

i = m + 1, . . . ,n — 1. 



(3.26) 



Condition (|3.13p gives that w = (A£ — Af , . . . , #0 — A m , A m+1 — 0o> • • • > A n _x — 
A£_ 2 ), which together with (gUD and (ET26]) . yields (ET25]) . Moreover, (pTHl) 
gives that 



m— 1 



i=l 



y; 



Oim.+l [Kn+l 



ra-1 



m+2 



2 a '' ( 



j-1 



0. 



Obviously, a^ = if A| < Af +1 , for i = 1, . . . ,m—l,m+l, . . . , n— 1 and A3. 18f) 
can be derived as in the proof of Lemma 13.11 For the block B® containing 
77i, we get that 



S 



A 



(0 



, v° (0) 

i£B0\{ m } V U np\P) 



[ T d + D - r »] 2 ^"' 



(I) 



l=i+l 



A 



+ 



(m) 



V° np (0) 



[xo-T (m) ] £ e? z W=0, 



Z=m+1 



which gives exactly (J3.19P . Therefore showing that the estimator A^ defined 
in (|3.17p satisfies the KKT conditions ()3.22[) - (j3.25|) also proves (ii). 

Recall that A^ is min(A^, 6q), for i = 1, . . . , m, and that A^ is the uncon- 
strained estimator when considering only the follow-up times Tn\, . . . , Ti m \. 
Moreover, A^ is max(A[%#o)) f° r i = m + 1, ... ,n — 1, where A^ can be 
viewed as the unconstrained estimator when considering only the follow-up 
times Tf m \, . . . , TL-i)- Note that f|3. 16|) together with (|3.9p imply that 
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EX 1 - [ T U+D - r W ] £ e^«> L 0, for i = 1, . . . ,n - 1. 

i<* [ A J 1=3+1 J 

(3.27) 
The condition holds for i = 1, . . . , m — 1, and, moreover, 

A n 

^. ^mw.(\f,9 ) l=j+1 

for i = 1, . . . ,m — 1. Therefore, min(Af ,9), for i = 1, ... ,m — 1 satis- 
fies (|3.22p . Furthermore, (|3.2T[) holds for i = m, which implies that 



771—1 A n 

E — 7#T7-PW>- r <,>] E-" 



im(Af,0 o ) 



A n 



h-r M ] ^ e^ 



min(A^,0 o ) i=i+; 



m a n 



i=i [mm(A|,#o) , =i+1 J 

hence A„ satisfies ()3.23|) as well. It is straightforward that max(A^,#o)) f° r 
i = m + 1, ... ,n — 1 satisfies (|3.24p . since, by definition, A^ satisfies (j3.9|) . 
for i = m + 1, . . . , n — 1, and 

f A n 1 



A 



Lax(Af,6» ) , =i+1 



^^-[^• + D-%] S ^H°- 



J>« I 3 l=j+l 



Finally, to check if A^ verifies the condition (|3.25p . we will argue on the 
blocks of indices on which \ n , and hence Xf and A^ are constant. By (13.6p . 
for each block Bj, with j = 1, . . . , k, on which the unconstrained estimator 
has the constant value v n j(/3), 
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and 



2 ^-[r« + i.-r w ] S^-Uo. 

ieB d { Vn iyP> i= l+ i ) 

Then, on each block Bj that does not contain m, we can write 

Z=i+1 J 



jgB, I A * J=i+1 



(3.28) 



and this holds for Xf, as well as for A^ . It is straightforward that mm(\f,9o), 
for i = 1, . . . , m and max(A^, 6q), for z = m + 1, . . . , n — 1 satisfy this 
relationship. For the block B p that contains m, we have 

iGB p \{m} I A « J=i+1 J 

(A n n 1 

%* - [T (m+1) - x ] 2 e^co _ [xo _ T(m) ] ^ e^« A^ 
A m l=m+l l=m+l J 

iGB p \{m.} I A i f=i+l J 

(A n n 1 

^ - [T (m+1) - xo] 2 e^>-[* -T (m) ] 2 e^'%) . 
Am l=m+l l=m+l J 

Constraining A^ to be 9q on the interval [%o,T(m+i)) yields 
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[xo-T {m) ] f; e^«W 

l=m+l J 

n 



(3.29) 



Z=i+1 



-h-T (m) ] 2 e^«> 

l=m+l 



Once more, for i 6 B p , min(A^,#o) satisfies this relationship. Summing over 
all blocks in (|3.28p and (j3.29p completes the proof. □ 



Similar to the unconstrained estimator, we propose A°(x) = A„(x;/3 n ) 

5°. =v°( ' 

J n] "nj ' 



as the constrained estimator and v®a = v^Af3 n ), where j3 n is the maximum 



partial likelihood estimator. 

REMARK 3.1. As already pointed out in \1T^ , if we take all covari- 
ates equal to zero, the characterization of the unconstrained estimator dif- 
fers slightly from the characterization of the nondecreasing hazard estimator 
within the ordinary random censorship model, provided in \11$ . Correspond- 
ingly, the characterizations in Lemma \3.1\ and \3.S\ with all Z\ = differ 
from the characterizations furnished in J4}j. Although the estimators in J4}i do 
not maximize the (pseudo) loglikelihood function in (|3.1j) (in the absence of 
covariates and under the null hypothesis) over nondecreasing Xq, the asymp- 
totic distribution of the likelihood ratio test based on these estimators will 
coincide with our proposed distribution, in the case of no covariates. 

Using the notations in [3], let slogcm(/, /) be the left-hand slope of the 
greatest convex minorant of the restriction of the real- valued function / to 
the interval /. Denote by slogcm(/) = slogcm(/, R). Moreover, let 

slogcm°(/) = min (slogcm(/, (-oo,0]),0) l ( _ 00>0] +max(slogcm(/, (0,oo)),0) 1 (0]00 ). 

Furthermore, for positive constants a and b, define 

X a , b (t) = aW(t) + bt 2 , (3.30) 

where W is a standard two-sided Brownian motion originating from zero. 
Let 



1(3 



9a,b(t) = slogcm(X a>b )(t), (3.31) 

the left-hand slope of the GCM G a ^ of the process X a ^, at point t. The 
constrained analogous is defined as follows: for t < 0, construct the GCM 
of X a j,, that will be denoted by G^ b and take its left-hand slopes at point 
t, denoted by Di J (X at b)(t). When the slopes exceed zero, replace them by 
zero. In the same manner, for t > 0, denote the GCM of X a ^ by G^ b and 
its slopes at point t by Dn{X a ^){t). Replace the slopes by zero when they 
decrease below zero. This slope process will be denoted by g® b , which is 
thus given by 

(mm{D L (X a)b ){t),0) t < 0, 
g%(t) = I t = 0, (3.32) 

[max(D R (X a>b )(t),0) t > 0. 

Note that for t < 0, there exists, almost surely s < such that Di(X a fi)(s) 
is strictly positive for any point greater than or equal to s and the left 
derivative at s is non-positive. Equivalently, for t > there exists almost 
surely s > such that D/j(X aj b)(s) is strictly negative for any point smaller 
than or equal to s and the left derivative at s is non-negative. In addition, 
observe that g^ b {t) = slogcm (X a ^)(t), as defined and characterized in [5]. 

3.2 Nonincreasing baseline hazard 

The characterization of the unconstrained and the constrained NPMLE 
estimators of a nonincreasing baseline hazard function follows analogously 
to the characterization of the nondecreasing estimators. The unconstrained 
NPMLE X n (x; (3) is obtained by maximizing the (pseudo) likelihood function 
in flinD over all A (T (1) ) > ... > A(T (n) ) ^ 0. As derived in [T7], the 
likelihood is maximized by a nonincreasing step function that is constant on 
(T(i_i\,Tri\], for i = 1, . . . ,n and where Tr \ = 0. The (pseudo) loglikelihood 
in (|3.ip becomes then 

n ( n \ 

^ ] A (J) log A (T (i) ) - A (T (0 ) [T (i) - r (i _ 1} ] Je^w . (3.33) 

i=l I l=i ) 

The lemmas below provide the characterization of the unconstrained esti- 
mator \ n (x;(3) and the constrained estimator A„(x;/3). Their proofs follow 
by arguments similar to those in the proofs of Lemma 13.11 and Lemma 13.21 
as well as the necessary and sufficient conditions that uniquely characterize 
these estimators.. 
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LEMMA 3.3. Let Tm\ < . . . < TL) be the ordered follow-up times and 
consider a fixed /3 e R p . 

(i) Let W n be defined in (J3.3J) and let 

V n {x) = [5{u^ x}dF n (u,5,z). (3.34) 

Then, the NPMLE \ n (x; (5) of a nonincreasing baseline hazard function 
Ao is given by 

\ n ( X ;P) = i k r (i _ 1) <x<T w ,A W -i-l,...,n l 

(0 x>T {n) , 

for i = 1, . . . , n, with TVq) = and where Aj is the left derivative of the 
least concave majorant (LCM) at the point Pi of the cumulative sum 
diagram consisting of the points 

P j = (w n ((3,T U) ),V n (T ij) )), (3.35) 

for j = 1, . . . , n and Pq = (0, 0) . 

(ii) Let B\, . . . , Bf- be blocks of indices such that X n (x; j3) is constant on 
each block and B\ u . . . u B^ = {1, . . . , n}. Denote by v n j((3), the value 
of the estimator on block Bj . Then 

Vnj(P) 



In fact, for x ^ Tr n \, X n (x; f3) can take any value smaller than A n , the left 
derivative of the LCM at the point P n of the CSD. As before, we propose 
An(x) = \ n (x; j3 n ) as the estimator of Ao and v n j = v n j(/3 n ), where J3 n 
denotes the maximum partial likelihood estimator of (3q . Fenchel conditions 
as in (|3.9p and (|3. 10[) can be derived analogously. 

The NPMLE estimator A^ maximizes the (pseudo) loglikelihood function 
in (I3.33P over the set A (T (1) ) ^ . . . ^ A (T (m )) > 9 Ss A (T( m+1) ) > . . . Js 
\o(Tr n \) ^ 0. It can be argued that the constrained estimator has to be a 
nonincreasing step function that is constant on {T(i_i\,Tu\\, for i = 1, . . . , n, 
is #o on the interval (T( m ),xo], and is zero for x > T^ n y Hence, the (pseudo) 
loglikelihood function becomes 
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' z m 



' z (i) 



2 A (i) logA (%) - Ao(T (4) ) [T (0 - T^^^e^' 

i=l I J=i 

+ A (m+1 ) log Ao(T( m+ i)) - 6»o [x - T (m)] 2 e ^ 

Z=m+1 
n 

- A (T (m+1) ) [T (m+1) - xo] 2 e^o 

n ( n "j 

+ 2 | A (i) log Ao(T (i) ) - A (T (i) ) [T (i) - r (i _ 1} ] 2 e^(0 . 

The characterization of the constrained NPMLE A^ is provided with the 
next lemma. 

LEMMA 3.4. Let xq e (0, Tff) /ixed, suc/i t/iat T/ m ) < xo < Tr m+1 \, for a 
given 1 < m ^ n — 1 . Consider a fixed (3 e R p . 

(%) For i = 1, . . . , m, let Xf 1 to be the left derivative of the LCM at the point 
Pj L of the CSD consisting of the points P L = Pj, for j = l,...,m, 
with Pj defined in (|3.35p , and P L = (0,0). Moreover, for i = m + 
1, . . . , n, let Xj* be the left derivative of the LCM at the point P^ of the 
CSD consisting of the points P- = Pj, for j = m, m + 1 . . . , n, with 

Pj defined in (|3.35p . Then, the NPMLE A^(x;/3) of a nonincreasing 
baseline hazard function Xq, under the null hypothesis Hq : Xq = 9$, is 
given by 

A° T (i-i) < x < T (i ), for i = 1, . . . ,m,m + 2, . . . ,n, 

#o T( m ) < x < xq, 

A^ +1 xq <z<T (m+1 ), 



X° n (x-(3) 



x > T ( „) , 



(3.36) 
where T( ) = and where A° = max(Af,6>o), for i = l,...,m, and 
A° = min(Af ,9q), for i = m + 1, . . . , n. 

(mJ For k ^ 1, let B®,. . . ,B? be blocks of indices such that A^(x;/5) is 
constant on each block and B® u . . . u B® = {1, . . . , n}. There is one 
block, say B®, on which A°(x;/3) is 9q, and one block, say B®, that 
contains m + 1. On all other blocks B^j, denote by v^J/3) the value of 
A°(x;/3) on block B^. Then, 
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n , , HfeB° A (0 

vim - 



TiieB] [ T U) ~ T 0'-l)] 2"=j ^' 
On the block B®, that contains m + 1, 

V° np (f3) 

^teB« A (i) 



Evidently, we propose A°(x) = A°(x;/3 n ) as the constrained estimator 
of a nonincreasing baseline hazard function Ao, as well as v^ ■ = u{L(/3n) on 
blocks of indices where the estimator is constant. The Fenchel conditions 
corresponding to ()3.22[) - (j3.25[) can be derived in the same manner as for the 
constrained estimator in the nondecreasing case. 

Let slolcm(/, /) be the left-hand slope of the LCM of the restriction of the 
real- valued function / to the interval I. Denote by slolcm(/) = slolcm(/, H). 
For a, b > 0, let X a ^(t) = aW(t) — bt 2 , where W is a standard two-sided 
Brownian motion originating from zero. Denote by L ab the LCM of X a ^ 
and let 

l a)b {t) = slolcm(X aj6 )(i), (3.37) 

be the left-hand slope of L a ^, at point t. Additionally, set 

slolcm°(/) = max (slolcm(/, (-oo,0]),0) l ( _ lX , i0 ]+min (slolcm(/, (0,oo)),0) l(o,oo)- 

For t ^ 0, construct the LCM of X a b, that will be denoted by L^ b and take 
its left-hand slope at point t, denoted by DL{X a ^){t). When the slopes fall 
behind zero, replace them by zero. In the same manner, for t > 0, denote 
the LCM of X a> i, by L^ b and its slope at point t by Dj^(X a fi){t). Replace 
the slopes by zero when they exceed zero. This slope process will be denoted 
by I® b , which is thus given by 



Observe that l° b (t) = slolcm°(A > ajfe )(i). 



max( J D L (A > a>6 )(t),0) t < 0, 

t = 0, (3.38) 

mm(D R (X atb )(t),0) t > 0. 



20 



4 The limit distribution 

Let B[ oc (JR) be the space of all locally bounded real functions on R, equipped 
with the topology of uniform convergence on compact sets. In addition, 
C m j n (R) is defined as the subset of £>; oc (R) consisting of continuous func- 
tions / for which f{t) — ► co, when \t\ — ► oo and / has a unique minimum. 
Let C be the space of locally square integrable real- valued functions on R, 
equipped with the topology of L2 convergence on compact sets. 
For a generic follow-up time T, consider H uc {x) = P(T ^ x,A = 1), the 
sub-distribution function of the uncensored observations. Moreover, let 

.«.,«)-/(.>,} ^^4,). (4.1) 

for f3 e R p and x e R, where P is the underlying probability measure 
corresponding to the distribution of (T, A, Z). For a fixed point xq e (0, tjj), 
define the processes 



X n (x) = n 1 / 3 [X n (x + n-Wx) - 9 , 

; ( (4-2) 

Y n {x) = n 1 / 3 (X° n (xo + n-^x) - Q ) . 

The following lemma provides the joint asymptotic distribution of the above 
processes. 

LEMMA 4.1. Assume (Al) and (A2) and let xo e (0, t#). Suppose that X 
is nondecreasing on [0, 00) and continuously differentiable in a neighborhood 
of xq, with Ao(xo) 7^ and Aq(:£o) > 0. Moreover, assume that the functions 
x 1— > <£(/3o,x) and H uc {x), defined in (I4.ip and above (14. lj) . are continuously 
differentiable in a neighborhood ofxQ. Finally, assume that the density of the 
follow-up times is continuous and bounded away from zero in a neighborhood 
of xq • Define 



and b = h' (x ). (4.3) 



\/$(/3o,xo) 2 

Then (X n ,Y n ) converges jointly to (g a ,b,9ab)> i- n £ x £-, where the processes 
g a fi and g® b have been defined in (|3.3ip and (|3.32p . 

Proof. Note that the processes X n and Y n are monotone. By making use of 
Corollary 2 in |15j and the remark above the corollary, it suffices to prove 
that the finite dimensional marginals of the process (X n , Y n ) converge to the 
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finite dimensional marginals of the process (g a ,b,9ab)i m or der to prove the 
lemma. 

For x ^ T(i), let 

W n {x) = W n n ,x) - W n n ,T {1) ), 



where W n is defined in (|3.3p , and where (5 n is the maximum partial likelihood 
estimator. For fixed xq and x e [— k, k], with < k < oo, define the process 



Zn(x) 



■« 



2/3 



*(A)>zo) 



jV n (x + n 1/3 x)-K(x ) 



Ao(^o) f^n(x + n 1/3 2;) - W n (x ) 



(4.4) 



where V^ is defined in (|3.4|) . For a and b defined in (|4.3|) . Z„ converges 
weakly to X a ^, as processes in i^ oc (R), by Lemma 8 in [T7]. Define now 



S n (x) 



n 



1/3 



*(A),»o) 



W n (xo + n~ 1/3 x) - W n (x ) 



(4.5) 



From the proof of Lemma 9 in [17] . S n (x) converges almost surely to the 
deterministic function x, uniformly on every compact set. 

Following the approach in [11] . Lopuhaa and Nane [17] obtained the 
asymptotic distribution of the unconstrained maximum likelihood estimator 
X n by considering the inverse process 



(4.6) 



U n (z) = argmin \ V n (x) - zW n (x) \ , 

x-e[T {1) ,T (n) ] 



for z > 0, where the argmin function represents the supremum of times at 
which the minimum is attained. Since the argmin is invariant under addition 
of and multiplication with positive constants, it follows that 



n 



1/3 



U n (6 + n-^z) 



Xq 



argmin {Z n (x) - S n (x)z] 
xeI„(xo) 



where I n (xo) = [— n 1 ' 3 ^ — T^) , n 1 ' 3 (Tr n \ — Xo)]. For z > 0, the switching 
relationship X n (x) =S z holds if and only if U n {z) > x, with probability one. 
This translates, in the context of this lemma, to 



n 



1/3 



X n (x + n 1/3 x)-9 



«$ z 



n 



1/3 



U n (6 + n-^z) - x 



>x, 
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for < xq < th and 6q > 0, with probability one. The switching relationship 
is thus X n (x) < z <=> n 1 ' 3 \U n (6o + n~ 1 ' 3 z) — xo] ^ x. Hence finding the 
limiting distribution of X n (x) resumes to finding the limiting distribution 
of n 1 ' 3 \U n (0o + n~ 1 ' 3 z) — xq\. By applying Theorem 2.7 in [16], it follows 
that, for every z > 0, 



■/? 



V*\U n (9 + n-V :i z)-xo\±U(z), 
as inferred in the proof of Theorem 2 in [T7] , where 

U(z) = sup {t 6 R : X Qj b(t) — zt is minimal} . 
It will result that, for every x e [—k, k], 



P (X n (x) ^z)=P (n 1/3 A n (x + n" 1/3 x) - 6 < z 

= P (n 1/3 U n (9 + 7i~ 1/3 z) -xq >x) 
->P(U(z) >x). 



Using the switching relationship on the limiting process, it can be deduced 
that U{z) ^ x o g a ,b( x ) ^ z ) with probability one, and thus X n {x) — * 
9a,b{x). 

In order to prove the same type of result for Y n (x), consider first the 
following process 



Y n {x) = n 1 ' 3 (a„(x + n'^x) - 0„) 
where, for xq e (0, th), such that T( m ) < xq < T( m+1 ), 



(4.7) 







x < T, 



(i)' 



X n (x) = < 



\ T {i) < x < T (i+i)> for i = 1, . . . , m - 1 

^m T (m) < X < X , 

Xq SC X < T( m+1) , 

Af T(j) < x < T( i+1 ), for % = m + 1, . . . ,n — 1 



(«)' 

with A^ and A^ defined in Lemma 13.21 For this, we have considered up 
to xo an unconstrained estimator which is constructed based on the sample 
points Th\, . . . ,Tr m+1 \. Moreover, to the right of xq, we have considered 
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an unconstrained estimator based on the points T( m+1 ), . . . ,Tf n y It is not 
difficult to see that 



YJx) 



For z > 0, define the inverse processes 



min ( Y n (x),0 J x < 0, 
x = 0, 

max I y n (x),0 I x > 0. 



(4.8) 



C/ n (z) = argmin \ V n (x) - zW n (x) 

xe P(l) i^Cm+l)] 



ttfto 



argmm ^ Ki.(x) — zW n (x) 
a;e[T( m+1 ),T( n )] 



Take x < xo- The switching relationship for A n is given by A n (x) ^ z if and 
only if U^{z) > x, with probability one, which gives that 



7) 



1/3 



A n (x + n^ 3 x) - O ] < z & n 1 ' 3 \u^{6 Q + n - l l 3 z) - x 



> x, 



with probability one. Moreover, 
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1/3 



Un(6o + n l/3 z) - Xq = argmin {7L n {x) - S n (x)z} 



where I%(xq) = [-n 1/3 (x - T( 1 )),n 1 / 3 (T( m+1 ) - xq)]. Denote by 

Z n (z,x) = % n {x) - S n {x)z. 

As for the unconstrained estimator, we aim to apply Theorem 2.7 in [16] . 
As Theorem 2.7 in [16] applies to the argmax of processes on the whole real 
line, we extend the above process in the following manner 



x<-n 1 /3(x -r {1) ), 

-nV3( Xo _ T (1) ) < x < n l / 3 (T {m+1) - x ), 



r Z n (z,-nV 3 (x -T {1) )) 

Zn{z,x) = < Z n (z,x) 

^Z n (z,n l / 3 (T {rn+1) - xq)) + 1 x > n l l 3 (T(. m+l) - x ) 
Then, Z~(z,x) e Bi oc (R) and 
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1/3 



U^(9 + rT^z) - x 



argmin \Z n (z, x)j = argmax \—Z n (z, x)} . 

xeTR. xeTR. 
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Since Ao(xo) = #o > and Ao is continuously differentiable in a neigh- 
borhood of xq, it follows by a Taylor expansion and by Lemma 2.5 in [9] 
that n 1 ' 3 (T( m+1 ) — xq) = O p (n~ 1 logn). Therefore, by virtue of Lemma 
8 and Lemma 9 in |17| . the process x >—> —Z~(z,x) converges weakly to 
Z~{x) e C max (R), for any fixed z, where 



Z n (z, x) 



-X a h(x) 



1 



zx x < 0, 
x > 0, 



for a and b defined in f|4.3j) . Hence, the first condition of Theorem 2.7 in [16] 
is verified. The second condition follows directly from Lemma 11 in |17| . 
while the third condition is trivially fulfilled. Thus, for any z fixed, 



n 



1/3 



U^(B + n~ 1/3 z) - sol ^ tf~(«) 



where t/ (z) = sup {i < : X a ^{t) — zt is minimal}. Concluding, for x < 0, 



P y n (s) < z = P n 1 ^ A n (x + n-^x) - 6 



-1/3, 



P (n 1 / 3 E#(0 O + n _1/3 s) - x ] > a?) 



«S z 



-^P(l7-(z) >s). 

The switching relationship for the limiting process gives that U~ (z) ^ x <=> 
Di,(X a b)(x) < z, with probability one, where L>£(X a) fc)(x) has been defined 
as the left-hand slope of the GCM of X aj b, at a point x < 0. Hence, for 

x < 0, 

y„(i)iDi(i a , t )(x). 

Completely analogous, Yn(x) — ► Z)R(X a! ft)(x), for x > 0. By continuous 
mapping theorem and by (I4.8p . it can be concluded that for fixed x e [—k, k], 

Y n(x) ^ g%(x), 

where g® b has been defined in (|3.32[) . 

Our next objective is to apply Theorem 6.1 in [14]. The first condition 
of Theorem 6.1 is trivially fulfilled. The second condition follows by Lemma 
11 in [T7], while the third condition follows by the definition of the inverse 
processes. Hence, for fixed x, 

P(X n (x) sj z,Y n (x) sc z) -+P(g a>b (x) sj z,g° tb (x) sj z) , 
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for a and b defined in (|4,3p . The arguments for one dimensional marginal 
convergence can be extended to the finite dimensional convergence, as in the 
proof of Theorem 3.6.2 in [2], by making use of Lemma 3.6.10 in [2]. Hence, 
we can conclude that the finite dimensional marginals of the process (X n , Y n ) 
converge to the finite dimensional marginals of the process (g a ,b,9ab)- This 
completes the proof. 

□ 

By making use of results in [T7] , a completely similar result holds in the 
nonincreasing setting. 

LEMMA 4.2. Assume (Al) and (A2) and let x e (0, r#). Suppose that A 
is nonincreasing on [0, oo) and continuously differentiable in a neighborhood 
oJxq, with \q(xq) ^ and X'q(xq) < 0. Moreover, assume that the functions 
x —> Q(/3o,x) and H uc {x), defined in (I4.ip and above (14. lj) . are continuously 
differentiable in a neighborhood of xq. 
Then, for a and b defined in (|4.3p . (X n ,Y n ) converge jointly to (lafiilab) 

in C x C, where the processes l a j, and I® b have been defined in (J3.37P and 
([338]) . 

Subsequently, we state two immediate results, that will be used repeat- 
edly throughout the rest of the paper. 

LEMMA 4.3. Let xo e (0, tjj) fixed and let D n be the set on which the 
unconstrained NPMLE X n , defined in Lemma \3.1\ differs from constrained 
NPMLE X^, defined in Lemma \3. £1 Then, for any e > 0, there exists k £ > 
such that 

liminf P ( D n a [x - n _1//3 /c e ,x + n^^kS) ^ 1 - e. 

Proof. The proof of this fact follows by exactly the same reasoning as in the 
proof of Lemma 2.6 in [T], preprint for [3]. □ 



LEMMA 4.4. Consider the processes X n and Y n defined in (|4.2p . Then, 
for every e > and k > 0, there exists an M > such that 



Similarly, 



limsupP sup |X n (x)| > M\ ^ e. 

n-»oc \xe[-k,k] I 



limsupP sup |y n (x)| > M < e. 

n^>cD \xe[-k,k] J 
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Proof. The monotonicity of the processes X n and Y n yields that 
sup \X n (x)\ =max{\X n (-k)\,\X n (k)\}, 

xe[—k,k] 

sup \Y n (x)\ =msx.{\Y n (-k)\,\Y n {k)\}. 

xe[—k,k] 

Assume |X n (A;)| to be the maximum in the above display. Since for fixed 
k, X n (k) — * g a ,b{k)i with a and b defined in (|4,3|) . it will result that the 



processes X n and Y n in (|4.2p are, with high probability, uniformly bounded. 

□ 

The limiting distribution of the likelihood ratio statistic of a nondecreas- 
ing baseline hazard function Ao is supplied then by the subsequent theorem. 

THEOREM 4.1. Suppose (Al) and (A2) hold and let x e (0,t h ). As- 
sume that Ao is nondecreasing on [0, oo) and continuously differentiate in a 
neighborhood of xq, with Ao(xo) ¥= and X' (xo) > 0. Moreover, assume that 
H uc (x) and x — ► <l>(/3o,x), defined in (|4.ip and above (|4.ip . are continuously 
differentiate in a neighborhood of Xq. Let 21og£ n (#o) be the likelihood ratio 
statistic for testing Hq : Ao(xo) = Oq, as defined in (|3.2p . Then, 

21ogCn(0o)^B, 

where D = \ [(<?i,i (u)) 2 — (<7i i(^)) 2 ] du, with g± : \ and g\ 1 defined in (|3.3ip 
and (I3T32J1 . 

Proof By ([321) and (^20]) . the likelihood ratio statistic 2 log £„(0 O ) = 2Lg n (A n )- 
2La (A°) can be expressed as 

n—X n—l 



21og£ n (0 o ) =2 ^ A « log A„(T (4) ) - 2 £ A (0 log A°(T (i) ) 

n.— 1 n 

- 2 ^ [r (i+ i) - %] [\(T (i) ) - A°(T (i) )] ^ e& 

n 

-2[r (m+1) -xo][A n (r M )-0 o ] 2 eft 



z w 



' z (0 



;=m+i 



n 

2[xo-T (m) ][A n (T (m) )-A°(T (m) )] ^ e ^ 



z (0. 



Z=m+1 
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Let 



n-l 



d-1 



S n = 2 ^ A (i) log A n (r (i) ) - 2 2 A w log A°(T W ), 



(4.9) 



i=l 



i=l 



and denote by Z) n , the set of indices i on which X n (T/{\) differs from A° (T^). 
Hence, expanding both terms of 5 n around Ao(xo) = 6q, we get 



ieD n 



2 *« 



^0 



ieD n 



•f 



with 



ieD„ 



1 



^5 






A°(T (i) )-0 o 



J^m 



Rn = ~ 2 A 



(<)- 



AnC^(i)) — #0 



ieD„ 
#n,l — -Rn.2) 



W ) 



13 



is- 



w 



A°(r (0 ; 



A°*(%) 



13 



where A*(T(j)) is a point between A n (T(j)) and #o an d X^{Tn\) is a point 
between A^(T^) and 6>o- We want to show that JR^i and Rn,2, hence R n 
converge to zero, in probability. As for the R n \ term, it can be inferred that 



fln.il < 



lh 



u e D r , 



n 



vs (\ n (u) - e 



X*(u) 



dP n (u,S,z), 



where D n is the time interval on which A n differs from A^. Choose now 
e > and 7 > 0, and for xq 6 (0, th) fixed and k £ > 0, denote by I n = [xo — 
n~ l l z k £ ,XQ+n~ l ^k E ]. We can write R n> i = R rhl {D n c I n } + R n>1 {D n d: I n }. 



Since, by Lemma 14731 

P(\Rn,l{D n * I n }\ > 7) < P(A» * A*) < £, 

we will further focus on bounding |i? nj i{Z) n c I n }|- By Lemma 14.31 and by 
Lemma l474| there exists k £ > such that supj.gr k e ,k E ] A n (^o + n~ l ' 3 x) — 6q 
is O p (n^ 1 ' 3 ). Furthermore, since 
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sup 
xe[— fc e ,fc £ ] 



A*(x + n 1/3 x) - 9 



^ sup 



X n (x + n 1/3 x) - 9 



it will result that, for u e D n , 



nV3 (X n ( u ) - e ) 



is uniformly bounded 



A,*(u) is uniformly bounded away from zero. It will result that there 



and 



exists M > such that 



\Rn,i\ ^M 5{x - k £ n~ 1/3 ^ u sc x + k £ n~ 1/3 }d (P n - P) (u, S, z) 

+ M 5{x - k £ n~ 1/3 sj u < x + k £ n~ 1/3 }dP(u, 5, z) + o p (l). 

Chebyshev's inequality provides that the first term on the right-hand side 
is O p (n~ 2 ' 3 ). As the function H uc defined above (|4.ip is assumed to be 
continuously differentiable in a neighborhood of xo, the second term on the 
right-hand side is O p {n~ 1 ' 3 ). We can conclude that R n ,i = o p (l). Com- 
pletely similar, by using Lemma 14.31 and Lemma 14.41 it can be shown that 
R n ,2 = o p (l). Thus 2 log £ n (9 ) = A n - B n + o p (l), where 



ieD n 



ieD n \{m} 



(i) 



l=i+l 



n 

2[T (m+1) -x o ][A n (r M )-0o] 2 e/§;Z( ° 



(4.10) 



i=in+i 



2 [x - T (m) ] [A n (r (m) ) - A° (r (m) )] J] (Mo , 



and 



1 



B « = 72 }j A («) 1 ^™( T W) ~ ^° ~~ ^n( r (i)) 



n 2 



U ieD n 

Hence, A n can be written as A n = A n \ — A n 2, where 
2 

and 



(4.11) 



^ rid D 



A (0 - ^o [T (4+1) - r (0 ] 2 e& 

J=i+1 



CO 
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2 r i n 

A n2 =- 2 [x° n (T {i) ) - e \ I A (0 - e [T (4+1) - r (i) ] J] e^ ; 



ieD„\{m} 

2 



■(') 



J=i+1 



A°(T M )-0oJ<A (m) -^h-r (m) ] 2 e ^ 



CO 



For the term A n \, partition the set of indices D n into s consecutive blocks 
of indices B\,...,B S , such that A n is constant on each block. Denote by 
v n j the unconstrained estimator \ n (T(j\), for each % e Bj, with j = 1, . . . , s. 



By (j3T6f) . it follows that 



^ = iES 



? o )JA (i) -0o[T( i+ i)-T (O ] £ e^ 



CO 



l=i+l 



T S «W - flo) 2 A « - *o 2 [r (<+1) - %] ^ e& 



'(0 



i=i 



iG_B, 



ieBj 



l=i+l 



2 (<W - *o) 2 2 [T (i+1) - r (0 ] 2 e& 



-co 






ieBj 



=i+l 



jn J] [A n (r (l) )-^o] 2 i[T (m) -r (4) ] 2 e^ 



ieD„ 



J W 



l=i+l 



Define 



$. 



v (P 1 x) = j{u>x}eP' z dP n (u,6,z), 



and note that 



1 



1 



$ n (/i, «) du = - [r (i+1) - r (0 ] ^ e^ ; 



(0 



'[ T M> T (*+i)) l=i+l 

for each i = l,...,n — 1. The term A n i can then be written as 

2 



A nl = —n 



f r, -|2 

{u e L> n } |A n («) - (9 J $„(/3 n , u) dn, 



(4.12) 



where D n is the interval on which A n and A„ differ. Similarly, for the term 
A n 2, partition D n into q consecutive blocks of indices B®, . . . , B®, such that 
the constrained estimator A„ is constant on each block. There is one block, 
say B®, on which the constrained estimator is 6q, and one block, say B® that 
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contains m. On all other blocks -B?, denote by 02 o - the constrained estimator 
A°(T(j)), for each z e 2?? It will result that, 



*° & 



J=l ieB" 
2 



2=i+l 



2 

iEB°\{m) 



[Vn P ~ ) 



A (4) - O [T (i+1) - r (0 ] g e' 5 - 



(0 



i=i+l 



+ fl-(«Sp-«o)|A (ro) -e [xo-r (m) ] 2 A 



0o 



'(0 



Z=m+1 



t S K* - *>) 2 A « - *° S [ r (*+D - r «] 2 ^ 



'CO 



J7^r,p 



fefl° 



ieB °j 



l=i+l 



J np 



ieBO 



?o) j 2 A (0 - ^o 2 [T (i+1) - T (4) ] 2 e& 



'(I) 



+ [x -T (m) ] 2 e# 



i=t+i 

z, 



(0 



2=m+l 



By ([3TT8]) and (l3~T9|) . 

jVr,p 



».) 



j- o K-0o) 2 



2><i + i)--( ] £<Mo 

Z=i+1 

n 

i0\{m} 2=i+l 

+ [xo-T (m) ] ^ e ^ %) 



feBP 



■(n 



Z=m+1 



"00 



??■ 



2 1 

S [%0)-Ab] -[T {i+1) -T {t) ] 2 <M- 



iGD n \{m} 



i=i+l 



0o 



» 



[^n( T (m)) 



?0 



I 2 1 

n 



N-T (m) ] 2 ^ 

l=m+l 



■ID 



As A°(a?) = A„(T/ m )) on the interval [T/ m j, ceo) and A„(x) = 0o on the interval 
[xo,T( m+1 )), this gives that 
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r T (m+l) - 

JT (rn) 



X° n {u)-9 $ n (l3n,u)du 



f 

J1 (m) 



X° n (u) 



n i 



K( T (m)) -®0 



n2 



n 

[xo-T (m) ] ^ e^ z «. 



X° n {u)-9 $ n {p n ,u)du 



l=m+l 



This leads to 



A n 2 = -Q-n 



and, thus A n in (14.10P can be written as 



{ueL> n } X° n (u)-9 <f> n n ,u)du, 



A, 



-n 



f{« D n ) 



X n {u) 



X V n {u)-9 }^ n (P n ,u)du. 



In a similar manner, B n in (|4.1ip can be expressed as 



B„ 



^nj {ueD n } 



X n (u)-9 - X° n (u)-9 \ dV n {u), 



by (|3.4p and by noting that for every i = l,...,n— 1, 



I 



dF» = V n (T {i+1) ) - V n (T {l) ) = -A w . 

[T (i) ,T (i+1) ) ™ 



Concluding, 

21og£ n (fl )=|-n |{uE5 n } 



A„(u) 



A°(u)-0 o ^„09„,u)du 



~ ^2 n { U e - "} 1 ^«( U ) ~ ^0 



X° n (u)-9 \ dV n (u) + o p (l). 



Let V(x) = \5{u < x}dP(u,5,z), and see that, in fact, V{x) = H uc {x) 
where H uc has been defined above (|4.ip . Thus, 



21og£ n (# )=|-n f{ueL> n } 

~ ~E2 n { u e ^"} j ^™( n ) ~ ^o 
where R n = R n ± — R n 2, with 



X n (u) \x° n (u)-9o\ 2 \<5>(f3 ,u)du 



X° n (u)-9 \ dV(u) + R n + o p (l), 
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Rnl = -K-n 



\{ue D n ) 



X n {u) 



x° n (u) - e 



&n(Pn,u) -*(A),«)J du 



and 

Rn2 



Q2 n \ { u ^D n } 



XJu) 



-\2 



n2 



A>) -0o \d(V n (u)-V(u)). 



The aim is to show that R n \ and R n 2, and thus R n is o p (l). The term i? n i 
can be written as 



ln^J{ueD B } 



V 3 (X(u) - O ) 



h2 



*»(Ai,«) -$(A),«)) du. 



Lemma 4 in [T7] provides that 



sup 



$„(£„, a;) -$(/3 ,x) 



with probability one. From Lemma 14.41 and since §{u e D n } du < 2£; £ ?i~ 1 ' 3 , 
by Lemma 14.31 and by using similar arguments as for the term R n< i, we can 
conclude that R n \ is o p (l). Analogously, 



Rn2 = ^n 1 / 3 f {u e £„} | [n 1 / 3 (\(u) - O ) 

n 1 / 3 (A° (u) - (9 ) j 2 U d(P n - P) (u, 5, z) . 
Once more, by Lemma 14.31 and Lemma 14.41 there exists M2 > such that 

Juf2 r 

\R\12\ ^ -^n 1 / 3 5{ueD n } d(P n - P)(u, S, z), 

with arbitrarily large probability. Chebyshev's inequality along with the 
same reasoning as for the term R Hj i provides that R n 2 = o p (l). Hence, 
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X n (u)-9o - X° n (u) 



*(#,,«) d(u) 



x n (u)-e - A» 



dF(tt) + Op(l). 



21og£ n (# )=^-n f{ue A»} 

Consider the change of variable x = n 1 ' 3 (n— rco) and let D n = ji 1 ' 3 (l) n — xo) • 
This yields that 

2i og a#o) =r \{ xe 5n l [ x ' (x) _ y " 2(2:) ] $ ^°' x ° + n " 1/3x ) dx 

-^[{^4} [X 2 (x) - y n 2 (x) 2 ] V"(x + ™~ 1/3 *) dx + op(l) 
= |-cJ>(/?o,x ) J {x e 5 n } [X 2 (x) - y n 2 (x)] dx 

" ^'faO J {x £ 5 n } [X 2 (x) - Y n 2 (x)] dx + op(l). 



As inferred in IT 



A (x) 



d1/(x)/dx 



$(A),s) ' 
which gives that 

21og£ n (0 o ) = ^(/3o,x ) J {x e D n } [X 2 (x) - i; 2 (x)] dx + o p (l). 

Thus 

21og£„(0 o ) = ^ J {x e 5 n } [X 2 (x) - y n 2 (x)] dx + 0p (l), 

where a has been defined in (|4.3p . From Lemma 14.31 for every e > 0, we 
can find an interval [— k e ,k e ] such that TP(D n c [ — fce>fce]) > 1 — £, for n 
sufficiently large. In order to prove the theorem, we apply Lemma 4.2 in [19] . 
by taking 

Qn =^\{ Xe Dn} [X 2 n {x) - y„ 2 (x)] dx, 

Q n£ =1 J{x e [-At, fc £ ]} [X 2 (x) - y n 2 (x)] dx, 

Q £ = ^2 { X e ["^Ml (9a,b(x)) 2 ~ (s^fet^)) dx, 
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and 



Q 



a 2 ) 



x e D 



a,bl 



{9a, b {x)? - (g%(x)Y 



dx. 



where D a b denotes the set on which g a ^ and g® b differ. Condition (i) in 
Lemma 4.2 of Prakasa Rao follows by Lemma 14.31 In addition, Lemma 
4.31 and Lemma 14.11 yield condition (ii), since for every e > 0, we can 
find k £ > such that P(D a ,6 c [— k e ,k £ ]) > 1 — £■ The third condi- 
tion follows, for every fixed e, by Lemma 14.11 and by continuous map- 
ping theorem. Namely, (X n ,Y n ) => {g a ,b,9ab) as a process in £ x £ and 
(/, g) <->■ ^{x e [— c, c]}(/ 2 (x) — g 2 (x)) dx is a continuous function defined on 
£ x £ with values in R. Conclusively, 



^J>n(*) 



r„ 2 (i)] ied„ dx 



x e D a h} dx, 



J[( 5l , 1 ( 3 ;)) 2 -( 5 ? il ( 2; )) 2 j{xe J D 1 , 1 }dx 



by continuous mapping theorem and by Brownian scaling, as derived in [5]. 
This completes the proof. □ 

The asymptotic distribution of the likelihood ratio statistic in the non- 
increasing baseline hazard setting can be derived completely analogous. 

THEOREM 4.2. Suppose (Al) and (A2) hold and let x e {0,t h ). As- 
sume that Xq is nonincresing on [0, oo) and continuously differentiate in a 
neighborhood of xq, with Xq(xq) =£ and Aq(xo) < 0. Moreover, assume that 
H uc {x) and x — > $(/3o,x), defined in (|4.ip and above (|4.ip . are continuously 
differentiable in a neighborhood of xq. Let 21og£ n (#o) be the likelihood ratio 
statistic for testing Hq : Aof^o) = 0q, as defined in (|3.2p . Then, 



21og£„( 



D. 



Proof. Following the same reasoning as in the proof of Theorem 14.11 and by 
Lemma 14.21 it can be deduced that 



21og£n(#0) ^ ^2 \ [(M*)) 2 " i l a,b( X )) 2 ] { X e D a,b} 6x, 

where Z) a ,b is the set on which l a ^ and l° a b differ. By continuous map- 
ping theorem, it suffices to show that, for t fixed, l a ,b{^a,b){t) h as the same 
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distribution as g a ,b(X a ,b)(t) and l^ b {X a>b )(t) has the same distribution as 
g® b (X a>b )(t). It is noteworthy that 

slolcm(X aib ) (t) = -slogcm(-X a>6 )(t). 

Thus, by Brownian motion properties and continuous mapping theorem, 

P {la,b(t) < z) = p (-slogmc(-oW(t) + t 2 ) sc z) 

= P (-slogmc(oW(t) + t 2 ) < z) = P {-g a , b (t) sc z) . 

Concluding, la,b(X a ,b)(t) = — 9a,b(X a ,b)(t), and a similar reasoning can be 
applied to show that I® b (X atb )(t) = — g® b (X a ^ b )(t). The proof is then imme- 
diate, by continuous mapping theorem. □ 

REMARK 4.1. The same limiting distribution D is obtained for the loglik- 
leihood ratio statistic in the absence of covariates in J^, as well as in other 
censoring frameworks, as derived in \B$. In fact, it has been shown in J3f 
that the same holds true for a wide class of monotone response models. This 
distribution differs from the usual x 2 distribution, that is obtained in the reg- 
ular parametric setting. It is noteworthy that D does not depend on any of 
the parameters of the underlying model, and this property turns out to be 
particularly useful in constructing confidence intervals for the parameters of 
interest, as it will be exposed in the subsequent section. 

5 Pointwise confidence intervals via simulations 

Once having derived the asymptotic distribution of the likelihood ratio 
statistic, the practical application at hand is to construct, for fixed xq e 
(0,th), pointwise confidence intervals. We will derive such intervals, for 
a nondecreasing baseline hazard function Ao, evaluated at a fixed point xq,, 
based on simulated data and compare these intervals with the intervals based 
on the asymptotic distribution of the nondecreasing NPMLE A n . According 
to Theorem 2 in jTTj, for fixed xq, 

1/3 (\ l \ \ I \\ d f iX o( x o)K( x o)\ 1/3 ■ (tmtu-s , ,2i 

n /J Xn(x ) - Ao(x -> — — - — V- — argmmjW (t) + t } 

V J \ $(/3o,3:o) / xeTR. 

s C(x )Z, 

where W is standard two-sided Brownian motion starting from zero, and 
the constant C{xq) depends on xo and on the underlying parameters. An 

3(3 



estimator C n (xo) of C(xq) will then yield an 1 — a confidence interval for 
AoOo) 



r* 1 



X n (x ) - n-y 3 C n {x )q(^, 1 - a/2), \ n {x ) + n^ 3 C n {x )q{'E, 1 - a/2) 



where q(Ti, 1 — a/2) is the (1 — a/2) quantile of the distribution 7L. These 
quantiles have been computed in |13| . and we will further use q{Ti, 0.975) = 
0.998181. For simulation purposes, we propose 

/ „ „ \ 1/3 

Cn(soH 4 ^ ( *° R( * 0) ) , 

\ $ n (^ n ,X ) / 

where $ n (j3,x) has been defined in (|4.12|) . and /3 n is the maximum partial 
likelihood estimator. Lemma 4 in [T7] ensures that 3> n (/3 n , •) is a strong 
uniform consistent estimator of $(/3g,-). Furthermore, as an estimate for 
Aq(xo), we choose the numerical derivative of X n on the interval that contains 
xq, that is, the slope of the segment [A n (T( m )), A n (T( m+1 ))]. 

Pointwise confidence intervals for Xq(xq) can also be constructed by mak- 
ing use of Theorem 14. 11 Let 2 log £ n (6) denote the likelihood ratio for testing 
Hq : Ao(£o) = Q versus H\ : Ao(xo) ¥" 0. A 1 — a confidence interval is then 
obtained by inverting the likelihood ratio test 21og£ n (#) for different values 
of 9, namely 

C£ )a s{0 : 2tog£„(0)<c7(B,l-a)}, 

where q(D, 1 — a) is the (1 — a) th quantile of the distribution D. Quantiles 
of D, based on discrete approximations of Brownian motion, are provided 
in [6], and we will make use of q(JD, 0.95) = 2.286922. The parameter 9 
is chosen to take values on a fine grid between and 6. It can be shown 
immediately that, for large enough n, the coverage probability of C\ a is 
approximately 1 — a. 

For the performance analysis, we have constructed and compared, from 
simulated data, the confidence intervals C\ a and C^ Q , for a = 0.05 and 
various n. We will assume a Weibull baseline distribution function for the 
event times, with shape parameter 2 and scale parameter 1. For simplicity, 
we will assume that the covariate is single-valued and uniformly (0, 1) dis- 
tributed and take /3o = 0.5. Given the covariate, the censoring times are 
assumed to be uniformly (0, 1) distributed. We will choose xo = Viog - ^, the 
median of the baseline distribution of the event times. For each chosen sam- 
ple size, we generate 1000 replicates and compute the empirical coverage and 
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the average length of the corresponding confidence intervals. Furthermore, 
since we are simulating from a Weibull distribution with shape parameter 2 
and scale parameter 1, and hence know the true baseline hazard function Ao 
and its derivative, as well as the true underlying regression coefficient, we 
could also consider a confidence interval C\ a , given by 



C 



X n (x ) - n-^CoOroMZ, 1 - a/2), X n (x ) + n^ 3 C (x )q(Z, 1 - a/2) 



where Cq is a deterministic function given by 



Co(x ) 



AvX (x )X' (x ) \ 



1/3 



Table [T] reveals the performance, for various sample sizes, of the confi- 
dence interval C\ 05 based on the likelihood ratio method (LR) , the con- 
fidence interval C^ 005 , based on the asymptotic distribution (AD) of the 

scaled differences between the NPMLE A n and the true baseline hazard at 
a fixed point, as well as the confidence interval C\ 05 based on the known 
Weibull distribution (TD). 





LR 


AD 


TD 


n 


AL 


CP 


AL 


CP 


AL 


CP 


50 


4.275 


0.917 


5.203 


0.932 


1.506 


0.964 


100 


3.837 


0.923 


4.838 


0.941 


1.317 


0.953 


200 


3.009 


0.931 


4.605 


0.947 


1.247 


0.947 


500 


2.734 


0.947 


3.372 


0.948 


0.961 


0.964 


1000 


1.454 


0.942 


2.259 


0.940 


0.713 


0.957 


5000 


0.879 


0.945 


1.768 


0.952 


0.546 


0.953 



Table 1: Simulaton results for constructing 95% pointwise confidence inter- 
asymptotic distribution of 
(TD), in terms of average 



vals using the likelihood ratio C\ 05 (LR) or the asymptotic distribution of 



the NPMLE estimator C£ >0>05 (AD) and C^ om 
length (AL) and empirical coverage (CP). 
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It is noteworthy that for each sample size, the likelihood ratio method 
yields, on average, shorter pointwise confidence intervals in comparison with 
the confidence intervals based on the asymptotic distribution of the NPMLE 
estimator A n . Moreover, the confidence intervals based on the likelihood 
ratio exhibit comparable coverage probabilities with the confidence intervals 
C^o.o5; based on the asymptotic distribution. As expected, the highest 
coverage rate is attained by the confidence intervals C^ 005 . Furthermore, 
they also yield confidence intervals with the shortest length, on average. 
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